PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pavir.7KG302300.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Panicodae; Paniceae; Panicinae; Panicum
Family HD-ZIP
Protein Properties Length: 792aa    MW: 85396 Da    PI: 6.6102
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pavir.7KG302300.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.81.2e-2099155157
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                          +++ +++t++q+++Le++F++ ++p++++r++L+k+lgL+ rqVk+WFqNrR+++k+
  Pavir.7KG302300.1.p  99 KKRYHRHTPHQIQQLEAMFKEWPHPDEKQRADLSKRLGLEPRQVKFWFQNRRTQMKN 155
                          678899************************************************995 PP

2START173.81e-543085332205
                          HHHHHHHHHHHHHHHC-TT-EEEE........EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S. CS
                START   2 laeeaaqelvkkalaeepgWvkss........esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla. 77 
                          la +a++elvk+a+ +ep+W  s+        e +n +e+ ++f +  +     + +ea+r+sg+v  ++a lve ++d + +W+ ++  
  Pavir.7KG302300.1.p 308 LAMRAMDELVKMAQMNEPLWIPSVsspgsstmETLNWKEYSKTFLPCVGvkpigFVSEASRESGIVNIDSAALVEFFMDER-RWSDMFSc 396
                          6789****************9999887777776777777777776644499*****************************9.******** PP

                          ...EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEE CS
                START  78 ...kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepk 157
                             k++t+e is g      gal lm+aelq+lsplvp R+++f+R++ ql +g+w++vdvS+d  +  +    +v +++lpSg++++++
  Pavir.7KG302300.1.p 397 ivaKVSTIEEISAGvagsrdGALLLMQAELQVLSPLVPrREVTFLRFCNQLAEGVWAVVDVSIDGLERDQCLVTSVNCRRLPSGCVVRET 486
                          ****************************************************************9888877999**************** PP

                          CTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                START 158 snghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                          +ng +kvtwveh+++ ++++h+l+++l++sgla ga +w+atlqrqce
  Pavir.7KG302300.1.p 487 PNG-CKVTWVEHTEYHEASVHQLYKPLLRSGLALGAGRWLATLQRQCE 533
                          ***.*******************************************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.01E-2183156IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.8E-2386150IPR009057Homeodomain-like
PROSITE profilePS5007117.97796156IPR001356Homeobox domain
SMARTSM003897.8E-1997160IPR001356Homeobox domain
CDDcd000869.81E-2098157No hitNo description
PfamPF000464.2E-1899154IPR001356Homeobox domain
PROSITE patternPS000270131154IPR017970Homeobox, conserved site
PROSITE profilePS5084844.108298537IPR002913START domain
SuperFamilySSF559615.19E-32301534No hitNo description
CDDcd088752.43E-105302533No hitNo description
SMARTSM002343.7E-44307534IPR002913START domain
PfamPF018523.8E-47308533IPR002913START domain
SuperFamilySSF559619.77E-19555761No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 792 aa     Download sequence    Send to blast
MSFGDLLGGV GDAAAVPYPP YGAFASSPAL SLAVADAGRR RDGSGERAGS VPRGGGGGNA  60
KDAPEAEDDT RSSPMSGHLD VVLAGGGEDG EGGNPRKRKK RYHRHTPHQI QQLEAMFKEW  120
PHPDEKQRAD LSKRLGLEPR QVKFWFQNRR TQMKNQMERH ENTLLKQEND KLRAENLSIR  180
VAMRDAACSG CGGPALLGEM SLEEHHLRLE NARLRDELTR VCALTAKFIG KPLSPMALPP  240
VQQPHPMPGS SLDLAVTCVG SVPPSTMPVS TISELAGSVS SQMGTVITPV VTTPLAMGSG  300
DKSMFVQLAM RAMDELVKMA QMNEPLWIPS VSSPGSSTME TLNWKEYSKT FLPCVGVKPI  360
GFVSEASRES GIVNIDSAAL VEFFMDERRW SDMFSCIVAK VSTIEEISAG VAGSRDGALL  420
LMQAELQVLS PLVPRREVTF LRFCNQLAEG VWAVVDVSID GLERDQCLVT SVNCRRLPSG  480
CVVRETPNGC KVTWVEHTEY HEASVHQLYK PLLRSGLALG AGRWLATLQR QCEGLAILVS  540
SVAVPEHDSS AVPLEGKRSL LKLAERMMEN FCAGVSASSA EWSKLDVLTG SMRKDVRVMV  600
RKSVDEPGVP PGVVLSAATA VWMPVTPERL FNFLRNEELR AEWDILSNGG PMQQMLRIAK  660
GQLDGNSVTL LRADPTNTHL NSIFILQETC TDKSGAMVVY APVDFPAMQL VMGGGDSTYV  720
ALLPSGFAIL PGGSSAGGVG HKTSGSLLTV AFQILVNSQP TAKLTLESVD TVYSLISCTI  780
EKIKASLHCE V*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
194100PRKRKKR
29599RKRKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Pvr.258791e-104callus
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004978185.10.0PREDICTED: homeobox-leucine zipper protein ROC4
SwissprotQ7Y0V90.0ROC4_ORYSJ; Homeobox-leucine zipper protein ROC4
TrEMBLK3Y5A80.0K3Y5A8_SETIT; Uncharacterized protein
STRINGSi009396m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP120337116
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.10.0homeodomain GLABROUS 1